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BOOTH ENCODING CIRCUIT FOR A MULTIPLIER OF A 
MULTTPLY-ACCUMULATE MODULE 
[0001] The present application claims priority from U.S. Provisional Patent Application 

No. 60/269,450, entitled "A Low Power and High performance Multiply Accumulate (MAC) 
5 Module," the disclosure of which is incorporated herein by reference in its entirety. 
BACKGROUND OF THE INVENTION 
Field of the Invention 

[0002] The invention relates generally to the field of multiply accumulate modules. 

More specifically, the invention is directed towards a Booth encoding circuit within a multiplier 
10 of a multiply-accumulate module. 

Description of Related Art 
O [0003] Known digital systems may multiply two binary numbers, a multiplier binary 

J number and a multiplicand binary number, in a manner similar to the multiplication of two 

Hi decimal numbers. Specifically, such systems may examine successive bits of the multiplier, 

y I 

«15 beginning with the least significant bit of the multiplier. When the examined multiplier bit is a 

y3 

l?| "1," a multiplicand binary number is copied in its entirety to produce partial products. When the 
!L multiplier bit is a "0," zeros are copied to produce partial products. Moreover, the total number 
ill of copied zeros is equivalent to the number of bits of the multiplicand binary number. The 
Pi partial products in successive lines then may be shifted one position to the left relative to the 
p20 previous line. When all of the multiplier bits have been examined, the partial products in 
successive vertical lines are summed to produce the resultant product of the multiplier binary 
number and the multiplicand binary number. Nevertheless, in order to complete the 
multiplication process, the above-described multiplication process generates a partial product for 
each multiplier bit. 

25 [0004] Some known digital signal processors (DSP's) may be equipped with a multiply- 

accumulate module for the execution of such multiplication of the multiplier and the 
multiplicand. The speed of the multiply-accumulate module may affect the operation frequency 
of the DSP, such that increasing the speed of the multiply-accumulate module also may increase 
the operation frequency of the DSP. Some known multiply-accumulate modules may include a 

30 multiplier. In some known multiply-accumulate modules, the multiplier may account for about 
half of the delay in the critical path of the multiply-accumulate module. Consequently, 
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increasing the speed the multiplier substantially may increase the overall speed of the multiply- 
accumulate module. 

[0005] In order to decrease the number of partial products generated during the 

multiplication of the multiplier binary number and the multiplicand binary number, some known 
5 multipliers may employ a Booth encoding algorithm or method. Reducing the number of partial 
products may increase the speed of the multiplier. In order to reduce the number of partial 
products, a known Booth encoding algorithm may record a radix-2 multiplier Y into a radix-4 
multiplier Z with an encoded digital set, {-2, -1, 0, 1, 2}, such that the number of partial products 
may be reduced by one half. 
10 [0006] A Booth encoding circuit may be designed to employ such a Booth encoding 

algorithm. Referring to Fig. 1, a known Booth encoding circuit 100 is shown. Booth encoding 
circuit 100 may comprise a plurality of cells 102a-102d and a plurality of inputs Y 2n , Y 2n +i, and 
J Y 2n -b which may be the 211 th , 2n th + 1 , and 2n th - 1 bits of a multiplier Y, respectively. Inputs Y 2tl 

M and Y 2t i-i each may be connected to cells 102b and 102c, and input Y 2n +i may be connected to 

y i 

III 5 cells 102a, 102c, and 102d. Cell 102c may comprise a first switch 103 a and a second switch 
J|. 103b, each of which may be connected to an input of a NAND gate 104. An output of NAND 
^ gate 104 then may be connected to an output inverter 106 of cell 102c, such that the output of 
O cell 102c is located at node 1. In addition, the output of inverter 106 may be indirectly connected 
m to a Booth encoding circuit 100 output inverter 108, such that the output of Booth encoding 
g|0 circuit 100 is located at a node 2. 

U [0007] Moreover, Booth encoding circuit 100 also comprises a plurality of transistor 

paths between each of the inputs and the output of Booth encoding circuit 100. A path is defined 
herein as the electrical route over which a particular input signal must travel in order to reach the 
output of a Booth encoding circuit. A transistor stage level for a particular path is defined herein 

25 as the number of transistors through which an input flows in order to reach the output of a Booth 
encoding circuit. Booth encoding circuit 100 further may comprise at least one "critical 
transistor stage path," which is defined herein as the path or paths having the greatest transistor 
stage level between an input and the output of Booth encoding circuit 100. 
[0008] In Booth encoding circuit 100, the critical path flows through cell 102c, such that 

30 the critical path comprises NAND gate 104 and inverter 106. The output signal of inverter 106 
maybe logically expressed as inverter 106 output = (Y 2n+ i ® Y 2n ) * (Y 2n -i © Y 2n ), where © is the 
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logic symbol for an exclusive OR (XOR) logic gate, * is the logic symbol for an AND logic gate, 
and the bar is the logic symbol for NOT. Moreover, because NAND gate 104 is connected 
between inverter 106 and each of switches 103a and 103b, NAND gate 104 drives inverter 106 at 
least for the critical path of Booth encoding circuit 100. In addition, NAND gate 104 is cascaded 
5 with a plurality of p-channel transistors and a plurality of n-channel transistors, such as 
transistors 110a- 11 Oh. Because inverter 106 may drive a number of cells in a Booth decoder 
circuit (not shown) of the multiplier, as a load capacitance increases, the driving efficiency of 
cascaded n-channel and p-channel transistors connected to a NAND gate may be reduced, such 
that the speed of cell 102c also may be reduced. 
10 SUMMARY OF THE INVENTION 

[0009] Therefore, a need has arisen for compressors that overcome these and other 

shortcomings of the related art. A technical advantage of the present invention is that the speed 
| of the critical path cell of the Booth encoding circuit may be increased by driving the output 
inverter of the critical path cell with a second inverter. Specifically, the output inverter of the 
L[l5 critical path cell may be driven by the second inverter via a switch, which may be positioned 
between the inverters. 

[0010] According to an embodiment of the present invention, a Booth encoding circuit 

O comprises a plurality of cells, in which at least one of the cells comprises a plurality of inputs. 
S The cell also comprises a first plurality of transistors, which form a first logic stage, in which at 
|lo least one of the inputs is connected to at least one of the first plurality of transistors. The cell 
U further comprises a second plurality of transistors, which form a second logic stage, in which at 
least one of the inputs is connected to at least one of the second plurality of transistors. The cell 
also comprises a first output inverter connected to at least one of the second plurality of 
transistors, and a first switching means connected to at least one of the first plurality of 
25 transistors. The cell further comprises a second switching means connected to the first output 
inverter, and a second output inverter connected to the first switching means and the second 
switching means. Moreover, within a critical path of the Booth encoding circuit, the first output 
inverter drives the second output inverter. In one embodiment, the first output inverter drives the 
second output inverter via the second switching means. 
30 [001 1] According to another embodiment of the present invention, a multiplier comprises 

a Booth encoding circuit, which comprises a plurality of cells, in which at least one of the cells 



111 
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comprises a plurality of inputs. The cell also comprises a first plurality of transistors, which 
form a first logic stage, in which at least one of the inputs is connected to at least one of the first 
plurality of transistors. The cell further comprises a second plurality of transistors, which form a 
second logic stage, in which at least one of the inputs is connected to at least one of the second 
plurality of transistors. The cell also comprises a first output inverter connected to at least one of 
the second plurality of transistors, and a first switching means connected to at least one of the 
first plurality of transistors. The cell further comprises a second switching means connected to 
the first output inverter, and a second output inverter connected to the first switching means and 
the second switching means. Moreover, within a critical path of the Booth encoding circuit, the 
first output inverter drives the second output inverter. In one embodiment, the first output 
inverter drives the second output inverter via the second switching means. 
[0012] Other features and advantages will be apparent to persons of ordinary skill in the 

art in view of the following detailed description of the invention and the accompanying 
drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0013] For a more complete understanding of the present invention, needs satisfied 

thereby, and the features and advantages thereof, reference now is made to the following 
descriptions taken in connection with the accompanying drawings. 
[0014] Fig. 1 is a schematic of a known Booth encoding circuit. 

[0015] Fig. 2 is a schematic of a Booth encoding circuit according to an embodiment of 

the present invention. 

[0016] Fig. 3 is a schematic of a transfer gate switch according to an embodiment of the 

present invention. 

[0017] Fig. 4 is a flow-chart of a parallel multiplier according to an embodiment of the 

present invention. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

[0018] Preferred embodiments of the present invention and their advantages may be 

understood by referring to Figs. 2-4, like numerals being used for like corresponding parts in the 
various drawings. 

[0019] Referring to Fig. 4, a flow-chart of a parallel multiplier 400 according to an 

embodiment of the present invention is described. Parallel multiplier 400 may comprise a 
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multiplier register 402, a multiplicand register 406, a result register 418, and a multiply- 
accumulate module 420, which may comprise a multiplier 419. Multiplier register 402 may 
comprise a first binary number and multiplicand register 406 may comprise a second binary 
number. For example, the first binary number may be a 17 bit number and the second binary 
5 number also may be a 17 bit number. Moreover, multiplier 419 may multiply the first binary 
number and the second binary number. Multiplier 419 may comprise a Booth encoder circuit 
200, a plurality of pre-processing cells 408, any known Booth decoder circuit 410, any known 
Wallace tree circuit 412, any known adder circuit 414, and any known saturation detection 
circuit 416. The possible structures of such known Booth decoders; Wallace trees; adders; and 
10 saturation detectors, respectively, will be readily understood by those of ordinary skill in the art. 
Moreover, those of ordinary skill in the art will understand that parallel multiplier 400 may 
employ any known Booth encoder; Booth decoder; Wallace tree; adder; and saturation detector, 
-3 respectively. Therefore, such structures will not be discussed in detail. 

[0020] Multiplier register 402 may be connected to Booth encoder 200, which may be 

U15 connected to Booth decoder 410. Multiplicand register 406 may be connected to each pre- 
jj processing cell 408, such the second binary number may be pre-processed in each pre-processing 
W cell 408. In addition, each pre-processing cell 408 may be connected to Booth decoder 410. 
Booth decoder 410 may be connected to Wallace tree 412, which may be connected to adder 
414, such that multiplier 419 may multiply the first binary number and the second binary 
CIO number. Moreover, adder 414 may be connected to saturation detector 416, which may be 
U connected to result register 41 8, such that the product of the first binary number and the second 
binary number may be stored in result register 418. 

[0021] Result register 418 further may be connected to Wallace tree 412, such that a 

product of a subsequent first binary number and a subsequent second binary number may be 

25 added to the previous output stored in result register 418, Le., the product of the first binary 
number and the second binary number. As such, the previous output stored in result register 418 
may be replaced by a new output from multiply-accumulate module 420. Moreover, the new 
output from multiply-accumulate module 420 stored in result register 418 may be expressed as 
An = An-1 + Xi*Yi, where An-1 is the output from multiply-accumulate module 420 previously 

30 stored in result register 418, Xi*Yi is the product of the first binary number and the second 
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binary number being multiplied by multiply-accumulate module 420, and An is the new value 
stored in result register 418, which replaces An-1. 

[0022] Referring to Fig, 2, in each of the above-described embodiments of the present 

invention, Booth encoder 200 may comprise a plurality of cells 202a-202d, and a plurality of 
5 inputs Y 2n , Y 2n+l , and Y 2n -i, which may be the 2n th , 2n th + 1, and 2n th - 1 bits of a multiplier Y, 
respectively. Inputs Y 2n and Y 2n _i each may be connected to cells 202b and 202c, and input Y 2n +i 
may be connected to cells 202a, 202c, and 202d. Cell 202c may comprise a first plurality of 
transistors 203, which may comprise transistors 204, 206, 208, and 210. 
[0023] Transistors 204 and 206 each may be a p-channel type transistor, such as a p- 

10 channel type metal oxide semiconductor (MOS) transistor, or the like. Transistors 208 and 210 
each may be a n-channel type transistor, such as a n-channel type MOS transistor, or the like. 
Moreover, in one embodiment, input Y 2n may be connected to transistors 204 and 208, and input 
'% Y 2n _i may be connected to transistors 206 and 210. A source of transistor 204 may be connected 
O to a high potential V d a. Moreover, a drain of transistor 204 may be connected to a drain of 
!|f 5 transistor 206 and a drain of transistor 208, such that the drain of transistor 206 also may be 
% connected to the drain of transistor 208. The drain of transistor 204, the drain of transistor 206, 
W and the drain of transistor 208 may form a node 3, which may be an output of first plurality of 
O transistors 203. In addition, a source of transistor 208 may be connected to a drain of transistor 
|* 210, and a source of transistor 210 may be connected to a low or ground potential. 

GEO [0024] In operation, when input signals Y 2n and Y 2n _i each are high signals, ie., may be 

Pi 

2 logically expressed as a binary "1" signal, transistors 204 and 206 each may be in a non- 
conductive state. Similarly, transistors 208 and 210 each may be in a conductive state. 
Consequently, a drain potential of transistor 208 may be a low potential, and the output at node 3 
may be a low potential. When input signals Y2 n and Y 2n .i each are low signals, Le., may be 

25 logically expressed as a binary "0" signal, transistors 204 and 206 each may be in a conductive 
state. Similarly, transistors 208 and 210 each may be in a non-conductive state. Consequently, a 
drain potential of transistor 208 may be a high potential, and the output at node 3 may be a high 
potential. When input signal Y 2n is a high signal and input Y 2tl -i is a low signal, transistors 206 
and 208 each may be in a conductive state. Similarly, transistors 204 and 210 each may be in a 

30 non-conductive state. Consequently, the drain potential of transistor 206 may retain its high 
potential and the output at node 3 may be a high potential. When input signal Y 2n is a low signal 
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and input Y 2n -i is a high signal, transistors 206 and 208 each may be in a non-conductive state. 
Similarly, transistors 204 and 210 each may be in a conductive state. Consequently, the drain 
potential of transistor 204 may retain its high potential and the output at node 3 may be a high 
potential. Because the output at node 3 may be a low potential when input signals Y 2n and Y 2n -i 
each are high signals, and may be high potential for all other combinations of input signals Y 2n 
and Y 2n -i, first plurality of transistors 203 may be a NAND gate logic stage. 
[0025] Cell 202c also may comprise a second plurality of transistors 211, which may 

comprise transistors 212, 214, 216, and 218. Transistors 212 and 214 each may be a p-channel 
type transistor, such as a p-channel type metal oxide semiconductor (MOS) transistor, or the like. 
Transistors 216 and 218 each may be a n-channel type transistor, such as a n-channel type MOS 
transistor, or the like. Moreover, in one embodiment, input Y 2n may be connected to transistors 
212 and 216, and input Y 2n _i may be connected to transistors 214 and 218. Moreover, a source 
of transistor 212 may be connected to a high potential V d d, and a drain of transistor 212 may be 
connected to a source of transistor 214. A drain of transistor 214 may connected to a drain of 
transistor 216, and also may be connected to a drain of transistor 218, such that the drain of 
transistor 214, the drain of transistor 216, and the drain of transistor 218 may form a node 4. 
Moreover, node 4 may be an output of second plurality of transistors 211. In addition, a source 
of transistor 216 and a source of transistor 218 may be connected to a low or ground potential. 
In operation, the second plurality of transistors 21 1 may comprise an NOR gate logic stage. The 
output at node 4 may be connected to a first output inverter 222, such that the output of first 
output inverter 222 may be logically expressed as a OR signal. 

[0026] Cell 202c further may comprise a first switching means 224 and a second 

switching means 226. First switching means 224 may comprise a first transfer gate switch 238, 
which is illustrated symbolically and by circuit diagram in Fig. 3, having an input A and an 
output B. Similarly, second switching means 226 may comprise a second transfer gate switch 
238. The output signal at node 3 may be connected to an input of first switching means 224 and 
the output signal of first output inverter 222 may be connected to an input of second switching 
means 226. Input signal Y 2n+ i may be connected to switching means 224 and also may be 
connected to switching means 226. Input signal Y 2n+ i further may be connected to an input 
inverter 220, which may be connected to first switching means 224 and second switching means 
226. The output of first switching means 224 and second switching means 226 may be 
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connected to a second output inverter 228. Moreover, the signal at second output inverter is the 
output signal of cell 202c. Moreover, the output signal of second output inverter 228 may be 
logically expressed as Y 2n +i * (Y 2n * Y 2n -i) + Y 2n +i * (Y 2n + Y 2n -i). 

[0027] In each of the above-described embodiments of the present invention, Booth 

5 encoder 200 further may comprise a transistor 230, a transistor 232, a transistor 234, and a Booth 
output inverter 236. The output signal of Booth output inverter 236 may be the output signal of 
Booth encoding circuit 200. Booth encoding circuit 200 also may comprise a plurality of 
transistor paths between each of the inputs and the output of Booth encoding circuit 200. A path 
is defined herein as the electrical route over which a particular input signal must travel in order 
10 to reach the output of a Booth encoding circuit. A transistor stage level for a particular path is 
defined herein as a number of transistors through which an input flows in order to reach the 
output of a Booth encoding circuit. Booth encoding circuit 200 further may comprise at least 

Ipsa 

43 one "critical transistor stage path," which is defined herein as the path or paths having the 
JfJ. greatest transistor stage level between an input and the output of Booth encoding circuit 200. 

y r. 

IH 5 For example, input signal Y 2n may flow through transistor 212, transistor 214, first output 
In inverter 222, second switching means 226, second output inverter 228, transistor 230, transistor 
■ 232, transistor 234, and Booth output inverter 236, Thus, within cell 202c, the transistor stage 
O level for this particular path (A) is five, and within Booth encoding circuit 200, the transistor 
Ijl stage level for this particular path (A) is nine. Moreover, because no other paths within Booth 
C|0 encoding circuit 200 may have a transistor stage level within Booth encoding circuit 200 which 
\& is greater than nine, path A also may be a critical transistor stage path for Booth encoding circuit 
200. 

[0028] Moreover, within critical transistor stage path (A) of Booth encoding circuit 200, 

second output inverter 228 is driven by first output inverter 222 for at least the critical transistor 
25 stage path of Booth encoding circuit 200. In one embodiment, second output inverter 228 is 
driven by first output inverter 222 for at least the critical transistor stage path of Booth encoding 
circuit 200 via switching means 226. 

[0029] For example, an embodiment of the present invention as shown in Fig. 2 was 

constructed. In this embodiment of the present invention, the speed of cell 202c was 10% faster 
30 than the speed of cell 102c, as shown in Fig. 1. Moreover, in this embodiment of the present 
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invention, an amount of power consumed by cell 202c was 8% less than an amount of power 
consumed by cell 102c, as shown in Fig. 1. 

[0030] While the invention has been described in connecting with preferred 

embodiments, it will be understood by those of ordinary skill in the art that other variations and 
modifications of the preferred embodiments described above may be made without departing 
from the scope of the invention. Other embodiments will be apparent to those of ordinary skill in 
the art from a consideration of the specification or practice of the invention disclosed herein. It 
is intended that the specification and the described examples are considered as exemplary only, 
with the true scope and spirit of the invention indicated by the following claims. 
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